Model-based approach to envelope and positive instantaneous frequency estimation of signals with speech applications
نویسندگان
چکیده
An analytic signal s(t) is modeled over a T second duration by a pole-zero model by considering its periodic extensions. This type of representation is analogous to that used in discrete-time systems theory, where the periodic frequency response of a system is characterized by a finite number of poles and zeros in the z-plane. Except, in this case, the poles and zeros are located in the complex-time plane. Using this signal model, expressions are derived for the envelope, phase, and the instantaneous frequency of the signal s(t). In the special case of an analytic signal having poles and zeros in reciprocal complex conjugate locations about the unit circle in the complex-time plane, it is shown that their instantaneous frequency ~IF! is always positive. This result paves the way for representing signals by positive envelopes and positive IF ~PIF!. An algorithm is proposed for decomposing an analytic signal into two analytic signals, one completely characterized by its envelope and the other having a positive IF. This algorithm is new and does not have a counterpart in the cepstral literature. It consists of two steps. In the first step, the envelope of the signal is approximated to desired accuracy using a minimum-phase approximation by using the dual of the autocorrelation method of linear prediction, well known in spectral analysis. The criterion that is optimized is a waveform flatness measure as opposed to the spectral flatness measure used in spectral analysis. This method is called linear prediction in spectral domain ~LPSD!. The resulting residual error signal is an all-phase or phase-only analytic signal. In the second step, the derivative of the error signal, which is the PIF, is computed. The two steps together provide a unique AM-FM or minimum-phase/all-phase decomposition of a signal. This method is then applied to synthetic signals and filtered speech signals. © 1999 Acoustical Society of America. @S0001-4966~99!01003-6#
منابع مشابه
Application of Model-Based Estimation to Time-Delay Estimation of Ultrasonic Testing Signals
Time-Delay-Estimation (TDE) has been a topic of interest in many applications in the past few decades. The emphasis of this work is on the application of model-based estimation (MBE) for TDE of ultrasonic signals used in ultrasonic thickness gaging. Ultrasonic thickness gaging is based on precise measurement of the time difference between successive echoes which reflect back from the back wall ...
متن کاملSpeech analysis and synthesis using an AM-FM modulation model
In this paper, the AM{FM modulation model is applied to speech analysis, synthesis and coding. The multiband demodulation pitch tracking algorithm is proposed that produces smooth and accurate fundamental frequency contours. The AM{ FM modulation vocoder represents speech as the sum of resonance signals modeled by their amplitude envelope and instantaneous frequency signals. E cient modeling an...
متن کاملRecognizing the Emotional State Changes in Human Utterance by a Learning Statistical Method based on Gaussian Mixture Model
Speech is one of the most opulent and instant methods to express emotional characteristics of human beings, which conveys the cognitive and semantic concepts among humans. In this study, a statistical-based method for emotional recognition of speech signals is proposed, and a learning approach is introduced, which is based on the statistical model to classify internal feelings of the utterance....
متن کاملA Novel Sampling Approach in GNSS-RO Receivers with Open Loop Tracking Method
Propagation of radio occultation (RO) signals through the lower troposphere results in high phase acceleration and low signal to noise ratio signal. The excess Doppler estimation accuracy in lower troposphere is very important in receiving RO signals which can be estimated by sliding window spectral analysis. To do this, various frequency estimation methods such as MUSIC and ESPRIT can be adopt...
متن کاملAM-FM estimation for speech based on a time-varying sinusoidal model
In this paper we present a method based on a time-varying sinusoidal model for a robust and accurate estimation of amplitude and frequency modulations (AM-FM) in speech. The suggested approach has two main steps. First, speech is modeled as a sinusoidal model with time-varying amplitudes. Specifically, the model makes use of a first order time polynomial with complex coefficients for capturing ...
متن کامل